SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Zabeau, Marc 

Vos, Pieter 
Simons, Guus 

(ii) TITLE OF INVENTION: RESISTANCE AGAINST WILT INDUCING FUNGI 
(iiij NUMBER OF SEQUENCES : 12 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SPENCER & FRANK 

(B) STREET: 1100 New York Avenue, N.W., Suite 300 East 

(C) CITY: Washington 

(D) STATE: DC 

(E) COUNTRY: USA 

(F) ZIP: 20005 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

<C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/EP96/034 8 0 

(B) FILING DATE: 06-AUG-1996 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: EP 95401849.5 

(B) FILING DATE: 07-AUG-1995 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Gollin, Michael A. 

(B) REGISTRATION NUMBER: 31,957 

(C) REFERENCE/ DOCKET NUMBER: GUPLA 0008 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-414-4000 

(B) TELEFAX: 202-414-4040 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc - "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
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GACTGCGTAC CAATTCNNN 



19 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 
(A) DESCRIPTION: /desc = "primer 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 2 : 
GATGAGTCCT GAGTAANNN 19 
(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
CTCGTAGACT GCGTACC 17 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) 



MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



AATTGGTACG CAGTCTAC 



18 



(2) INFORMATION FOR SEQ ID NO : 5 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 16 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
GACGATGAGT CCTGAG 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
TACTCAGGAC TCAT 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
GACTGCGTAC CAATTC 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GATGAGTCCT GAGTAA 

(2) INFORMATION FOR SEQ ID NO: 9: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
GACTGCGTAC CAATTCAGA 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
GATGAGTCCT GAGTAATCT 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6658 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1798.. 5595 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 11 



\ 

AAAAAAGCAG CTTTAAAAAA AGTACTTTKG AAAGGKGCTG AAACTTATTT TTTGAAATAA 60 

GCAGTTATGT GTTTGGAAWA AAAGTGCTGA AGTTGCTATG TCAAACATGA AAAGGGRAAA 120 

AATGGAAGAA AGAGWTGTTA GGGTTATGTC GTAATTTGGA GATTGTATAA AAATATTAAG 18 0 

GGCAAAAAAA ATAAAAATGT GTCAACTTAA AACAGCTTAT AAGCTAAAAG TTAAAAGCTG 24 0 

GGGTAGAGGT GTTTTTTTTT TTTTTAGCTT ATAAGTTGTT TTAAGTTGAC CACATTTTTA 300 

TTTTTKTTGC CCTTAATATT TTTATACAAT CTCAAAATTA CGACATAACC CTAACATCTT 360 

TTTCTCCCAT TTTTTCCTTT TCACGTTTGA CATAGCAACT TCAGCACTTT TATCCAAACA 420 

CATAACTGCT TATTTTAAAA ATAAGTTTCA GCACTTTCAA AAGTACTTTT TTAAAGCTGC 4 80 

TTTTATTAAG CCCATCCAAA CGGGCCCTAA AATTGCTAAT GTTTGCTCTT TCTATTCTCA 54 0 

AACTCCGTAA TATTTAAGAA AATTTGCTAA TGATAGGTCA CTTTTAACAC TAAATAATTA 600 

TAAATTGGGT AGAAATTTAT TTATCATTTT AAGCTTTTTT AATTTTGAGT CTTCTCCCTA 660 

ATTAAGACCC TTCCCCTCTT GCTTCAATTA TTTAACTGAA TAGTCTTTGT CTTATTGTTG 720 

GGTGAAAGTC TGTCTTCTTG TTAGGTACTA AGTCCTACAA TAATATCAAT 'AATTTGCTAT '780 

GGAGAAAAAA ATATTATAGG AGAAAAATAA TTAATTTTAA TTCATGAATA TGTCTTAATA 840 

TGCAACTCAT TTTGCTTATA TATATCAAAT TAAACTCTGT TCCTTTAACT TTTTCCTATG 900 

AAG AT AC AT T TTAATTTATT TGATGAGGTT AGTTTTGAAA TTTATATTAT AAT AAT G AAA 960 

TGATATAACT TAAAAGAAGT TGTTTGATAT CTTATCAGAA TCATGCAGGT ACTCATAATA 1020 

TAAGAAATAA TTATGATGAA AT T TAT AT AT GTTTTATGCA GAGATTTATT ACGCATTGTT 108 0 

TACTTGGGTT ATGTATTACT TATTTCATCT TTTATCAGAA TGTAAAATTA TCATTCAATA 114 0 

AGAAATCCAA TTCTGTTAAA TTCAAAATAC AAACAATAAC ATTTTCAAGA CCGATTTTTT 1200 

GCCCAAGAAT ATACAGTAAA CATATTTATG ATATGGTAGG TCTCTTTAGT AATTGACCAA 12 60 

CAAGGATTGT GGTGGAGTGG GAAATACTCT TTAATACTTC AC C AAG AG G T CTCCAATTTG 1320 

AGCCCCTGAA TACGAAATCG TCTTTGTTAG TATATACCCT AACCTAATAC AAAAATTAGT 138 0 

ATATTAGCCT TNACAGCTAA AATCTTTGTG ACCTGTAAGT CACGCGAGGA CAAATTTACC 14 4 0 

GTAACACCAA CTTATTCATG ATATAATTGT CCCTTTTAGC ACGGTAATAA TGAGGTGGGT 1500 

AGAAATTTAT TACTTGAGGG CCCTTTCTAC ACCCACCCTT ATTCTCTTGC TTCAATTATT 15 60 

G AAT T G AAG A AG T AAT G AAA AAACAGACTC CATTGGATAA AGGACAGTTT G C AAAC AC AG 1620 

CTGTAACAAT TTAGAGCACT AGCAAAATAG AGAGAGTTTT GAGAGAAATT TTTGTTTGCA 168 0 

AATTACTCTT AACCTTCAGC AGGTAAAATA AAGTTCTTAA CTGAGACTAT TTGAAGATAT 17 4 0 

ATTTTGTTAA AG AAT CAT TT TGTGTGTTTC CTTGTTTTGC TTTTGCAGAT TTGAGAA 17 97 
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ATG GAG ATT GGC TTA GCA GTT GGT GGT GCA TTT CTC TCC TCA GCT TTG 184 5 

Met Glu lie Gly Leu Ala Val Gly Gly Ala Phe Leu Ser Ser Ala Leu 
15 10 15 

AAT GTT CTG TTT GAT AGG CTT GCT CCT AAC GGT GAT CTG CTC AAC ATG 18 93 

Asn Val Leu Phe Asp Arg Leu Ala Pro Asn Gly Asp Leu Leu Asn Met 

20 25 30 

TTT CGG AAG CAT AAG GAT CAT GTT AAG CTC TTA AAG AAG CTG AAA ATG 1941 

Phe Arg Lys His Lys Asp His Val Lys Leu Leu Lys Lys Leu Lys Met 

35 40 45 

ACT TTG CGT GGT ATT CAG ATT GTG CTA AGT GAT GCA GAG AAT AAG CAA 198 9 

Thr Leu Arg Gly lie Gin lie Val Leu Ser Asp Ala Glu Asn Lys Gin 
50 55 60 

GCA TCA AAT CCA TCT GTG AGA GAC TGG CTT AAT GAG CTT CGA GAT GCT 2037 

Ala Ser Asn Pro Ser Val Arg Asp Trp Leu Asn Glu Leu Arg Asp Ala 

65 70. 75 80 

GTC GAC TCT GCT GAA AAT TTA ATA GAA GAA GTC AAT TAT GAA GCT TTG 208 5 

Val Asp Ser Ala Glu Asn Leu lie Glu Glu Val Asn Tyr Glu Ala Leu 

85 90 95 

AGG CTT AAG GTG GAA GGT CAG CAT CAG AAT TTT TCA GAA ACA AGC AAC 2133 

Arg Leu Lys Val Glu Gly Gin His Gin Asn Phe Ser Glu Thr Ser Asn 

100 105 110 

CAG CAA GTA AGT GAT GAT TTT TTC CTT AAC ATA AAG GAC AAG CTG GAA 2181 

Gin Gin Val Ser Asp Asp Phe Phe Leu Asn lie Lys Asp Lys Leu Glu 
115 120 125 

GAC ACT ATT GAA ACA TTA AAG GAT TTG CAA GAG CAA ATT GGT CTC CTT 222 9 

Asp Thr lie Glu Thr Leu Lys Asp Leu Gin Glu Gin lie Gly Leu Leu 
130 135 140 

GGC TTA AAG GAG TAT TTT GAT TCC ACG AAA CTA GAA ACT AGA AGA CCT 2277 

Gly Leu Lys Glu Tyr Phe Asp Ser Thr Lys Leu Glu Thr Arg Arg Pro 

145 150 155 160 

TCA ACT TCT GTG GAT GAT GAA TCT GAT ATC TTT GGT AGG CAG AGC GAA 2325 

Ser Thr Ser Val Asp Asp Glu Ser Asp lie Phe Gly Arg Gin Ser Glu 

165 170 175 

ATA GAG GAT TTG ATT GAC CGT CTA TTG TCT GAA GGT GCA AGT GGG AAA 237 3 

lie Glu Asp Leu lie Asp Arg Leu Leu Ser Glu Gly Ala Ser Gly Lys 

180 185 190 

AAG CTG ACA GTA GTT CCT ATC GTT GGA ATG GGC GGC CAG GGC AAG ACA 2 4 21 

Lys Leu Thr Val Val Pro lie Val Gly Met Gly Gly Gin Gly Lys Thr 
195 200 205 

ACA CTT GCT AAA GCC GTA TAC AAT GAT GAG AGG GTG AAG AAT CAT TTT 24 69 

Thr Leu Ala Lys Ala Val Tyr Asn Asp Glu Arg Val Lys Asn His Phe 
210 215 220 

GAT TTG AAA GCG TGG TAT TGC GTT TCT GAA GGA TTT GAT GCT TTG AGA 2517 

Asp Leu Lys Ala Trp Tyr Cys Val Ser Glu Gly Phe Asp Ala Leu Arg 

225 230 235 240 



ATA ACA AAA GAA TTA CTC CAA GAA ATT GGC AAA TTT GAC TCG AAG GAT 2565 

lie Thr Lys Glu Leu Leu Gin Glu lie Gly Lys Phe Asp Ser Lys Asp 

245 250 255 

GTC CAC AAC AAT CTT AAC CAG CTT CAA GTC AAA TTG AAG GAA AGT TTG 2 613 

Val His Asn Asn Leu Asn Gin Leu Gin Val Lys Leu Lys Glu Ser Leu 

260 265 270 

AAG GGA AAG AAG TTC CTT ATT GTT TTG GAT GAT GTG TGG AAT GAA AAT 2 661 

Lys Gly Lys Lys Phe Leu lie Val Leu Asp Asp Val Trp Asn Glu Asn 

275 280 285 

TAC AAC GAG TGG AAT GAC TTG AGA AAT ATT TTT GCA CAA GGA GAT ATA 27 09 

Tyr Asn Glu Trp Asn Asp Leu Arg Asn lie Phe Ala Gin Gly Asp lie 

290 295 300 

GGA AGT AAG ATC ATT GTG ACG ACA CGC AAA GAC AGT GTT GCC TTG ATG 2757 

Gly Ser Lys lie lie Val Thr" Thr Arg Lys Asp Ser Val Ala Leu Met 

305 310 315 320 

ATG GGA AAT GAG CAA ATT CGC ATG GGC AAT TTG TCT ACC GAA GCC TCT 2805 

Met Gly Asn Glu Gin lie Arg Met Gly Asn Leu Ser Thr Glu Ala Ser 

325 330 335 

TGG TCT TTA TTT CAA AGA CAT GCA TTT GAA AAC ATG GAT CCT ATG GGA 2 8 53 

Trp Ser Leu Phe Gin Arg His Ala Phe Glu Asn Met Asp Pro Met Gly 

340 345 350 

CAT CCG GAA CTT GAA GAG GTC GGA AGA CAA ATT GCA GCC AAG TGC AAA 2 901 

His Pro Glu Leu Glu Glu Val Gly Arg Gin lie Ala Ala Lys Cys Lys 

355 360 365 

GGA CTG CCC TTA GCT CTG AAG ACG CTC GCT GGC ATG TTA CGC TCC AAA 2 94 9 

Gly Leu Pro Leu Ala Leu Lys Thr Leu Ala Gly Met Leu Arg Ser Lys 

370 375 380 

TCA GAG GTT GAA GAG TGG AAA CGT ATT TTG AGA AGT GAA ATA TGG GAG 2997 

Ser Glu Val Glu Glu Trp Lys Arg lie Leu Arg Ser Glu lie Trp Glu 

385 390 395 400 

CTG CCA CAC AAT GAC ATA TTA CCA GCG TTG ATG TTG AGC TAC AAT GAT 304 5 

Leu Pro His Asn Asp lie Leu Pro Ala Leu Met Leu Ser Tyr Asn Asp 

405 410 415 

CTT CCC GCA CAT TTA AAG CGA TGC TTT TCT TTT TGT GCA ATA TTT CCT 30 93 

Leu Pro Ala His Leu Lys Arg Cys Phe Ser Phe Cys Ala lie Phe Pro 

420 425 430 

AAA GAT TAT CCA TTT AGG AAA GAA CAA GTT ATT CAT CTA TGG ATT GCC 3141 

Lys Asp Tyr Pro Phe Arg Lys Glu Gin Val lie His Leu Trp lie Ala 

435 440 445 

AAT GGT CTC GTA CCA GTG AAA GAT GAA ATA AAT CAA GAT TTA GGC AAC 318 9 

Asn Gly Leu Val Pro Val Lys Asp Glu lie Asn Gin Asp Leu Gly Asn 

450 455 460 



CAA TAC TTT CTA GAG TTG AGA TCA AGA TCA TTA TTT GAA AAG GTC CCA 
Gin Tyr Phe Leu Glu Leu Arg Ser Arg Ser Leu Phe Glu Lys Val Pro 
465 470 475 480 
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AAT CCT TCT AAA AGG AAC ATA GAG GAA TTA TTC CTT ATG CAT GAC CTT 32 8 5 

Asn Pro Ser Lys Arg Asn lie Glu Glu Leu Phe Leu Met His Asp Leu 

485 490 495 

GTC AAT GAT TTA GCC CAA CTT GCA TCT TCA AAA CTT TGT ATC AGG TTA 3333 

Val Asn Asp Leu Ala Gin Leu Ala Ser Ser Lys Leu Cys lie Arg Leu 

500 505 510 

GAA GAG AGC CAA GGA TCT CAT ATG TTG GAA CAA TGT CGG CAC TTA TCT 3381 

Glu Glu Ser Gin Gly Ser His Met Leu Glu Gin Cys Arg His Leu Ser 
515 520 525 

TAT TCA ATA GGA TTT AAT GGT GAG TTT AAG AAA TTG ACA CCC CTC TAC 34 2 9 

Tyr Ser lie Gly Phe Asn Gly Glu Phe Lys Lys Leu Thr Pro Leu Tyr 

530 535 540 

AAA TTG GAG CAG TTG AGG ACA TTG CTT CCG ATA CGT ATT GAA TTC AGA 3477 

Lys Leu Glu Gin Leu Arg Thr Leu Leu Pro lie Arg lie Glu Phe Arg 

545 550 555 560 

TTG CAC AAT CTA AGC AAG AGG GTG TTG CAT AAC ATA CTG CCT ACA CTA 3525 

Leu His Asn Leu Ser Lys Arg Val Leu His Asn lie Leu Pro Thr Leu 

565 570 575 

AGA TCC TTG AGG GCC CTA TCA TTC TCT CAA TAC AAG ATT AAG GAG TTG 357 3 

Arg Ser Leu Arg Ala Leu Ser Phe Ser Gin Tyr Lys lie Lys Glu Leu 

580 585 590 

CCA AAT GAC TTG TTT ACC AAA TTA AAG CTC CTC AGA TTT TTG GAT ATT 3621 

Pro Asn Asp Leu Phe Thr Lys Leu Lys Leu Leu Arg Phe Leu Asp lie 
595 600 605 

TCT CGG ACA TGG ATT ACA AAG TTG CCG GAT TCC ATT TGT GGA TTA TAT 3669 

Ser Arg Thr Trp lie Thr Lys Leu Pro Asp Ser lie Cys Gly Leu Tyr 

610 615 620 

AAC TTG GAG ACA CTT CTC CTG TCA TCT TGT GCT GAT CTT GAG GAG CTA 3717 

Asn Leu Glu Thr Leu Leu Leu Ser Ser Cys Ala Asp Leu Glu Glu Leu 

625 630 635 640 

CCG CTG CAG ATG GAG AAG TTG ATT AAC TTG CGT CAT CTT GAC GTA AGC 37 65 

Pro Leu Gin Met Glu Lys Leu lie Asn Leu Arg His Leu Asp Val Ser 

645 650 655 

AAC ACT CGG CGC TTG AAG ATG CCA CTA CAT CTG AGC AGG TTG AAA AGC 3813 

Asn Thr Arg Arg Leu Lys Met Pro Leu His Leu Ser Arg Leu Lys Ser 

660 665 670 

CTC CAA GTG TTG GTG GGA CCC AAG TTT TTT GTA GAT GGT TGG AGA ATG 38 61 

Leu Gin Val Leu Val Gly Pro Lys Phe Phe Val Asp Gly Trp Arg Met 
675 680 685 

GAA GAT TTG GGT GAA GCA CAA AAC TTA CAT GGA TCT CTA TCA GTT GTG 3909 

Glu Asp Leu Gly Glu Ala Gin Asn Leu His Gly Ser Leu Ser Val Val 

690 695 700 

AAG TTG GAA AAT GTG GTT GAT AGA AGG GAA GCT GTG AAG GCA AAG ATG 3 957 

Lys Leu Glu Asn Val Val Asp Arg Arg Glu Ala Val Lys Ala Lys Met 

705 710 715 720 



AGG GAG AAG AAT CAT GTT GAG CAA TTA TCA TTG GAG TGG AGT GAA AGT 4 005 

Arg Glu Lys Asn His Val Glu Gin Leu Ser Leu Glu Trp Ser Glu Ser 

725 730 735 

AGT ATT GCC GAC AAT TCA CAA ACA GAA AGT GAC ATA CTT GAT GAG CTA 4 053 

Ser lie Ala Asp Asn Ser Gin Thr Glu Ser Asp lie Leu Asp Glu Leu 

740 745 750 

TGC CCA CAT AAA AAC ATC AAA AAA GTC GAA ATC AGT GGA TAT AGA GGG 4101 

Cys Pro His Lys Asn lie Lys Lys Val Glu lie Ser Gly Tyr Arg Gly 

755 760 765 

ACA AAC TTT CCC AAT TGG GTA GCT GAT CCT TTG TTT CTT AAG CTG GTG 414 9 

Thr Asn Phe Pro Asn Trp Val Ala Asp Pro Leu Phe Leu Lys Leu Val 

770 775 780 

AAT TTG TCT CTA AGA AAC TGC AAG GAC TGT TAC TCC TTG CCA GCA CTA 4197 

Asn Leu Ser Leu Arg Asn Cys Lys Asp Cys Tyr Ser Leu Pro Ala Leu 

785 790 795 800 

GGA CAA CTC CCT TGT TTG AAA TTC CTT TCC GTT AAA GGG ATG CAT GGA 424 5 

Gly Gin Leu Pro Cys Leu Lys Phe Leu Ser Val Lys Gly Met His Gly 

805 810 815 

ATA AGA GTG GTG ACG GAA GAA TTC TAT GGC AGA TTG TCC TCC AAA AAG 4 2 93 

lie Arg Val Val Thr Glu Glu Phe Tyr Gly Arg Leu Ser Ser Lys Lys 

820 825 830 

CCT TTT AAC TCT CTA GAG AAG CTT GAA TTT GAA GAT ATG ACG GAG TGG 4 341 

Pro Phe Asn Ser Leu Glu Lys Leu Glu Phe Glu Asp Met Thr Glu Trp 

835 840 845 

AAG CAA TGG CAC GCA CTA GGA ATT GGA GAG TTC CCT ACA CTT GAG AAC 4 38 9 

Lys Gin Trp His Ala Leu Gly lie Gly Glu Phe Pro Thr Leu Glu Asn 

850 855 860 

CTT TCA ATT AAA AAT TGC CCT GAG CTC AGT TTG GAG ATA CCC ATC CAA 4 4 37 

Leu Ser lie Lys Asn Cys Pro Glu Leu Ser Leu Glu lie Pro lie Gin 

865 870 875 880 

TTT TCA AGT TTA AAA AGG TTA GAA GTT AGT GAT TGT CCA GTT GTT TTT 4 4 85 

Phe Ser Ser Leu Lys Arg Leu Glu Val Ser Asp Cys Pro Val Val Phe 

885 890 895 

GAT GAT GCC CAA CTG TTT AGA TCC CAA CTT GAG GCA 'ATG AAG CAG ATT 4533 

Asp Asp Ala Gin Leu Phe Arg Ser Gin Leu Glu Ala Met Lys Gin lie 

900 905 910 

GAG GAA ATA GAT ATA TGT GAT TGT AAC TCT GTT ACC TCC TTT CCT TTT 4 581 

Glu Glu lie Asp lie Cys Asp Cys Asn Ser Val Thr Ser Phe Pro Phe 

915 920 925 

AGC ATA CTG CCA ACT ACC TTG AAG AGA ATA CAG ATA TCT CGT TGC CCA 4 62 9 

Ser lie Leu Pro Thr Thr Leu Lys Arg lie Gin lie Ser Arg Cys Pro 

930 935 940 

AAA TTG AAA TTG GAG GCG CCA GTT GGT GAG ATG TTT GTG GAG TAT TTG 4 677 

Lys Leu Lys Leu Glu Ala Pro Val Gly Glu Met Phe Val Glu Tyr Leu 

945 950 955 960 
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AGA GTG AAT GAT TGT GGT TGT GTA GAT GAT ATA TCA CCT GAG TTT CTC 4 72 5 

Arg Val Asn Asp Cys Gly Cys Val Asp Asp lie Ser Pro Glu Phe Leu 

965 970 975 

CCA ACA GCA CGT CAA TTG AGT ATT GAA AAT TGC CAG AAC GTT ACT AGG 4773 

Pro Thr Ala Arg Gin Leu Ser lie Glu Asn Cys Gin Asn Val Thr Arg 

980 985 990 

TTT TTG ATT CCT ACT GCC ACT GAA ACT CTC CGT ATT TCG AAT TGT GAG 4 821 

Phe Leu lie Pro Thr Ala Thr Glu Thr Leu Arg lie Ser Asn Cys Glu 
995 1000 1005 

AAT GTT GAA AAA CTA TCG GTG GCA TGT GGA GGA GCG GCC CAG ATG ACG 4 8 69 

Asn Val Glu Lys Leu Ser Val Ala Cys Gly Gly Ala Ala Gin Met Thr 
1010 1015 1020 

TCA CTG AAT ATT TGG GGA TGT AAG AAG CTC AAG TGT CTT CCA GAA CTC 4 917 

Ser Leu Asn lie Trp Gly Cys Lys Lys Leu Lys Cys Leu Pro Glu Leu 

1025 1030 1035 1040 

CTT CCA TCT CTC AAG GAA CTG CGT CTG TCT GAT TGT CCA GAA ATA GAA 4 965 

Leu Pro Ser Leu Lys Glu Leu Arg Leu Ser Asp Cys Pro Glu lie Glu 

1045 1050 1055 

GGA GAA TTG CCC TTC AAT TTA GAA ATA CTC CGT ATC ATA TAT TGC AAG 5013 

Gly Glu Leu Pro Phe Asn Leu Glu lie Leu Arg lie lie Tyr Cys Lys 

1060 1065 1070 

AAA CTG GTG AAT GGC CGA AAG GAG TGG CAT TTA CAG AGA CTC ACA GAG 50 61 

Lys Leu Val Asn Gly Arg Lys Glu Trp His Leu Gin Arg Leu Thr Glu 
1075 1080 1085 

TTA TGG ATC GAT CAT GAT GGG AGT GAC GAA GAT ATT GAA CAT TGG GAG 510 9 

Leu Trp lie Asp His Asp Gly Ser Asp Glu Asp lie Glu His Trp Glu 
1090 1095 1100 

TTG CCT TGT TCT ATT CAG AGA CTT ACC ATA AAG AAT CTT AAA ACA TTA 5157 

Leu Pro Cys Ser lie Gin Arg Leu Thr lie Lys Asn Leu Lys Thr Leu 

1105 1110 1115 1120 

AGC AGC CAA CAT CTC AAA AGC CTC ACC TCT CTT CAA TAT CTA TGT ATT 5205 

Ser Ser Gin His Leu Lys Ser Leu Thr Ser Leu Gin Tyr Leu Cys lie 

1125 1130 1135 

GAG GGT TAT TTA TCT CAG ATT CAG TCA CAA GGC CAG CTT TCC TCC TTT 5253 

Glu Gly Tyr Leu Ser Gin lie Gin Ser Gin Gly Gin Leu Ser Ser Phe 

1140 1145 1150 

TCT CAC CTC ACT TCG CTT CAA ACT CTA CAA ATC TGG AAT TTC CTT AAT 5301 

Ser His Leu Thr Ser Leu Gin Thr Leu Gin lie Trp Asn Phe Leu Asn 
1155 1160 1165 

CTC CAA TCA CTT GCT GAA TCA GCA CTG CCC TCC TCC CTC TCT CAC CTG 534 9 

Leu Gin Ser Leu Ala Glu Ser Ala Leu Pro Ser Ser Leu Ser His Leu 
1170 1175 1180 

GAG ATA GAT GAT TGC CCT AAT CTC CAA TCA CTC TTC GAA TCA GCA CTG 53 97 

Glu lie Asp Asp Cys Pro Asn Leu Gin Ser Leu Phe Glu Ser Ala Leu 

1185 1190 1195 1200 



CCC TCC TCC CTC TCT CAG CTG TTC ATC CAG GAT TGC CCT AAT CTC CAA 
Pro Ser Ser Leu Ser Gin Leu Phe lie Gin Asp Cys Pro Asn Leu Gin 

1205 1210 1215 
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TCC CTT CCA TTT AAA GGG ATG CCC TCT TCC CTC , TCT AAA CTA TCT ATT 54 93 
Ser Leu Pro Phe Lys Gly Met Pro Ser Ser Leu Ser Lys Leu Ser lie 

1220 1225 1230 

TTC AAT TGC CCA TTG CTC ACA CCA CTA CTA GAA TTT GAC AAG GGG GAA 5541 
Phe Asn Cys Pro Leu Leu Thr Pro Leu Leu Glu Phe Asp Lys Gly Glu 
1235 1240 1245 

TAC TGG CCA CAA ATT GCT CAT ATT CCC ATC ATA AAT ATC GAT TGG AAA 558 9 
Tyr Trp Pro Gin lie Ala His lie Pro lie lie Asn lie Asp Trp Lys 
1250 1255 1260 

TAT ATT TAACAATTAA AACAAATGGC TCTCCAACTG ATGTAAGCTA TTCGTTACCC 5 64 5 

Tyr lie 

1265 

TTAGAAGCTT TTTATGATTC TATGTTTCTC AT TGC TT ATT GGTTTATGCT CTTACCGTGT 57 05 

TTTAATTCAC GTCTCAATTG CCACCATGTT TAATCGAAAG TTTTTAGTTC TTGTAATCAT 57 65 

CAACCATCCT ATGTCACTAG AAATTTTGAT AGGTAAAAGA GGTAGACAAA AAAGC T AAAC 58 25 

ATCTTTTTTC TTTCGTATAG CGACCAAACA ACTACATTTT GATAGGTAAG GGCTATAGAT 58 8 5 

ATACATTTGC AGGGTGTTAA ACCAAGGAGT AAG AAAAT C A CTGTCTTCAG ATATCTTCTC 5 94 5 

TTGCATATAC TTTTGCAATT TTAAGCTACA TTTTGAACTC ATGTGTTGTT GCTAACTTAA 6005 

ACATGTTTTG TGCTTAATCA GATGTGGATT TTGAAGAGCG AGTACGACAA GTCTGGTACA 60 65 

TTAATTGTCC GTAGAAGTGT TTCTAAGGTG CTGCTGCTAT TTTTACATCT GTTCCCGAGT 6125 

TTTGTTTTTT TTTTAAATCT TTCCACTAAA GCTATTATGT ' CGTCCACAGT GAATTTTCAG 6185 

GTCTGTTGTT ATAGGCAAGT CTTTGAGATG CGACTATCAA AGAAGGGCGA TTACAATCAG 624 5 

TGTACCGCTG AAACTATTTC ATGTTTCCAG TGCAAGCCTC TTTTGTAAGT TGACAAACTC 6305 

GATTAGTTAA TATGTTTGGG ACTCAACTAG TGGTTAGAGT ACTCATTTTG TAAGACTTGT 6365 

G T AC AG AAAA TCAAATTAGA ATTATAACTC GTGATGGTTG AATAAACTCT AAGAAGTACT 64 25 

GATATATTTT TTAGTGGATA TGTTGTTTGC TCATTCGGTG TTTGATATCC ACATTGGAGT 64 8 5 

CCAACTAAAT TCGAATTTGC ACAATCGAAG GAGCGGTGCT CCTGGCATGA TTTTTTTCCC 654 5 

ATTCTACGAC TAGTGCTCCT AAATTCTAAT TAAGCATAGA AAAATCTCAA CTATCTCACC 6605 

CAACTCATAT CAGGATAGAG TATTCCCTGA GGAGGATTCC TTCAGTTACA AAA 6658 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1266 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Glu lie Gly Leu Ala Val Gly Gly Ala Phe Leu Ser Ser Ala Leu 
15 10 15 

Asn Val Leu Phe Asp Arg Leu Ala Pro Asn Gly Asp Leu Leu Asn Met 

20 25 30 

Phe Arg Lys His Lys Asp His Val Lys Leu Leu Lys Lys Leu Lys Met 

35 40 45 

Thr Leu Arg Gly lie Gin lie Val Leu Ser Asp Ala Glu Asn Lys Gin 
50 55 60 

Ala Ser Asn Pro Ser Val Arg Asp Trp Leu Asn Glu Leu Arg Asp Ala 
65 70 75 80 

Val Asp Ser Ala Glu Asn Leu lie Glu Glu Val Asn Tyr Glu Ala Leu 

85 90 95 

Arg Leu Lys Val Glu Gly Gin His Gin Asn Phe Ser Glu Thr Ser Asn 

100 105 110 

Gin Gin Val Ser Asp Asp Phe Phe Leu Asn lie Lys Asp Lys Leu Glu 
115 120 125 

Asp Thr lie Glu Thr Leu Lys Asp Leu Gin Glu Gin lie Gly Leu Leu 
130 135 140 

Gly Leu Lys Glu Tyr Phe Asp Ser Thr Lys Leu Glu Thr Arg Arg Pro 
145 150 155 160 

Ser Thr Ser Val Asp Asp Glu Ser Asp lie Phe Gly Arg Gin Ser Glu 

165 170 175 

lie Glu Asp Leu lie Asp Arg Leu Leu Ser Glu Gly Ala Ser Gly Lys 

180 185 190 

Lys Leu Thr Val Val Pro lie Val Gly Met Gly Gly Gin Gly Lys Thr 
195 200 205 

Thr Leu Ala Lys Ala Val Tyr Asn Asp Glu Arg Val Lys Asn His Phe 
210 215 220 

Asp Leu Lys Ala Trp Tyr Cys Val Ser Glu Gly Phe Asp Ala Leu Arg 
225 230 235 240 

lie Thr Lys Glu Leu Leu Gin Glu lie Gly Lys Phe Asp Ser Lys Asp 

245 250 255 

Val His Asn Asn Leu Asn Gin Leu Gin Val Lys Leu Lys Glu Ser Leu 

260 265 270 

Lys Gly Lys Lys Phe Leu lie Val Leu Asp Asp Val Trp Asn Glu Asn 
275 280 285 

Tyr Asn Glu Trp Asn Asp Leu Arg Asn He Phe Ala Gin Gly Asp He 
290 295 300 



59 



Gly Ser Lys lie lie Val Thr Thr Arg Lys Asp Ser Val Ala Leu Met 
305 310 315 320 

Met Gly Asn Glu Gin lie Arg Met Gly Asn Leu Ser Thr Glu Ala Ser 

325 330 335 

Trp Ser Leu Phe Gin Arg His Ala Phe Glu Asn Met Asp Pro Met Gly 

340 345 350 

His Pro Glu Leu Glu Glu Val Gly Arg Gin lie Ala Ala Lys Cys Lys 
355 360 365 

Gly Leu Pro Leu Ala Leu Lys Thr Leu Ala Gly Met Leu Arg Ser Lys 
370 375 380 

Ser Glu Val Glu Glu Trp Lys Arg lie Leu Arg Ser Glu lie Trp Glu 
385 390 395 400 

Leu Pro His Asn Asp lie Leu Pro Ala Leu Met Leu Ser Tyr Asn Asp 

405 410 415 

Leu Pro Ala His Leu Lys Arg Cys Phe Ser Phe Cys Ala lie Phe Pro 

420 425 430 

Lys Asp Tyr Pro Phe Arg Lys Glu Gin Val lie His Leu Trp lie Ala 
435 440 445 

Asn Gly Leu Val Pro Val Lys Asp Glu lie Asn Gin Asp Leu Gly Asn 
450 455 460 

Gin Tyr Phe Leu Glu Leu Arg Ser Arg Ser Leu Phe Glu Lys Val Pro 
465 470 475 480 

Asn Pro Ser Lys Arg Asn lie Glu Glu Leu Phe Leu Met His Asp Leu 

485 490 495 

Val Asn Asp Leu Ala Gin Leu Ala Ser Ser Lys Leu Cys lie Arg Leu 

500 505 510 

Glu' Glu Ser Gin .Gly Ser His Met Leu Glu Gin Cys Arg His Leu Ser 
515 520 525 

Tyr Ser lie Gly Phe Asn Gly Glu Phe Lys Lys Leu Thr Pro Leu Tyr 
530 535 ■ 540 

Lys Leu Glu Gin Leu Arg Thr Leu Leu Pro lie Arg lie Glu Phe Arg 
545 550 555 560 

Leu His Asn Leu Ser Lys Arg Val Leu His Asn lie Leu Pro Thr Leu 

565 570 575 

Arg Ser Leu Arg Ala Leu Ser Phe Ser Gin Tyr Lys lie Lys Glu Leu 

580 585 590 

Pro Asn Asp Leu Phe Thr Lys Leu Lys Leu Leu Arg Phe Leu Asp lie 
595 600- 605 

Ser Arg Thr Trp lie Thr Lys Leu Pro Asp Ser lie Cys Gly Leu Tyr 
610 615 620 



Asn Leu Glu Thr Leu Leu Leu Ser Ser Cys Ala Asp Leu Glu Glu Leu 
625 630 635 640 



Pro Leu Gin Met Glu Lys Leu lie Asn Leu Arg His Leu Asp Val Ser 

645 650 655 

Asn Thr Arg Arg Leu Lys Met Pro Leu His Leu Ser Arg Leu Lys Ser 

660 665 670 

Leu Gin Val Leu Val Gly Pro Lys Phe Phe Val Asp Gly Trp Arg Met 
675 680 685 

Glu Asp Leu Gly Glu Ala Gin Asn Leu His Gly Ser Leu Ser Val Val 
690 695 700 

Lys Leu Glu Asn Val Val Asp Arg Arg Glu Ala Val Lys Ala Lys Met 
705 710 ■ 715 720 

Arg Glu Lys Asn His Val Glu Gin Leu Ser Leu Glu Trp Ser Glu Ser 

725 730 735 

Ser lie Ala Asp Asn Ser Gin Thr Glu Ser Asp lie Leu Asp Glu Leu 

740 745 750 

Cys Pro His Lys Asn lie Lys Lys Val Glu lie Ser Gly Tyr Arg Gly 
755 760 765 

Thr Asn Phe Pro Asn Trp Val Ala Asp Pro Leu Phe Leu Lys Leu Val 
770 775 780 

Asn Leu Ser Leu Arg Asn Cys Lys Asp Cys Tyr Ser Leu Pro Ala Leu 
785 790 795 800 

Gly Gin Leu Pro Cys Leu Lys Phe Leu Ser Val Lys Gly Met His Gly 

805 810 815 

lie Arg Val Val Thr Glu Glu Phe Tyr Gly Arg Leu Ser Ser Lys Lys 

820 825 830 

Pro Phe Asn Ser Leu Glu Lys Leu Glu Phe Glu Asp Met Thr Glu Trp 
835 840 845 

Lys Gin Trp His Ala Leu Gly lie Gly Glu Phe Pro Thr Leu Glu Asn 
850 855 860 

Leu Ser lie Lys Asn Cys Pro Glu Leu Ser Leu Glu lie Pro lie Gin 
865 870 875 880 

Phe Ser Ser Leu Lys Arg Leu Glu Val Ser Asp Cys Pro Val Val Phe 

885 890 895 

Asp Asp Ala Gin Leu Phe Arg Ser Gin Leu Glu Ala Met Lys Gin lie 

900 905 910 

Glu Glu lie Asp lie Cys Asp Cys Asn Ser Val Thr Ser Phe Pro Phe 
915 920 925 



Ser lie Leu Pro Thr Thr Leu Lys Arg lie Gin lie Ser Arg Cys Pro 
930 935 940 



Lys Leu Lys Leu Glu Ala Pro Val Gly Glu Met Phe Val Glu Tyr Leu 
945 950 955 960 

i 

Arg Val Asn Asp Cys Gly Cys Val Asp Asp lie Ser Pro Glu Phe Leu 

965 970 975 

Pro Thr Ala Arg Gin Leu Ser lie Glu Asn Cys Gin Asn Val Thr Arg 

980 985 990 

Phe Leu lie Pro Thr Ala Thr Glu Thr Leu Arg lie Ser Asn Cys Glu 
995 1000 1005 

Asn Val Glu Lys Leu Ser Val Ala Cys Gly Gly Ala Ala Gin Met Thr 
1010 1015 1020 

Ser Leu Asn lie Trp Gly Cys Lys Lys Leu Lys Cys Leu Pro Glu Leu 
1025 1030 1035 1040 

Leu Pro Ser Leu Lys Glu Leu Arg Leu Ser Asp Cys Pro Glu lie Glu 

1045 1050 1055 

Gly Glu Leu Pro Phe Asn Leu Glu lie Leu Arg lie lie Tyr Cys Lys 

1060 1065 1070 

Lys Leu Val Asn Gly Arg Lys Glu Trp His Leu Gin Arg Leu Thr Glu 
1075 1080 1085 

Leu Trp lie Asp His Asp Gly Ser Asp Glu Asp He Glu His Trp Glu 
1090 1095 1100 

Leu Pro Cys Ser He Gin Arg Leu Thr He Lys Asn Leu Lys Thr Leu 
1105 1110 1115 1120 

Ser Ser Gin His Leu Lys Ser Leu Thr Ser Leu Gin Tyr Leu Cys He 

1125 1130 1135 

Glu Gly Tyr Leu Ser Gin He Gin Ser Gin Gly Gin Leu Ser Ser Phe 

1140 1145 1150 

Ser His Leu Thr Ser Leu Gin Thr Leu Gin He Trp Asn Phe Leu Asn 
. H55 1160 1165 

Leu Gin Ser Leu Ala Glu Ser Ala Leu Pro Ser Ser Leu Ser His Leu 
1170 1175 1180 

Glu He Asp Asp Cys Pro Asn Leu Gin Ser Leu Phe Glu Ser Ala Leu 
1185 1190 1195 1200 

Pro Ser Ser Leu Ser Gin Leu Phe He Gin Asp Cys Pro Asn Leu Gin 

1205 1210 1215 

Ser Leu Pro Phe Lys Gly Met Pro Ser Ser Leu Ser Lys Leu Ser He 

1220 1225 1230 



Phe Asn Cys Pro Leu Leu Thr Pro Leu Leu Glu Phe Asp Lys Gly Glu 
1235 1240 1245 



Tyr Trp Pro Gin lie Ala His lie Pro lie lie Asn lie Asp Trp Lys 
1250 1255 1260 

Tyr lie 
1265 



